Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 8522 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 21 |
|---|---|
| Categorical | 1 |
NumRadicalElectrons has constant value "0" | Constant |
df_index is highly correlated with NumRadicalElectrons | High correlation |
NumValenceElectrons is highly correlated with HeavyAtomCount and 5 other fields | High correlation |
HeavyAtomCount is highly correlated with NumValenceElectrons and 5 other fields | High correlation |
NHOHCount is highly correlated with NumHDonors | High correlation |
NOCount is highly correlated with NumValenceElectrons and 4 other fields | High correlation |
NumAliphaticCarbocycles is highly correlated with NumAliphaticRings and 2 other fields | High correlation |
NumAromaticCarbocycles is highly correlated with NumAromaticRings | High correlation |
NumAromaticHeterocycles is highly correlated with NumAromaticRings and 2 other fields | High correlation |
NumAliphaticHeterocycles is highly correlated with NumAliphaticRings and 2 other fields | High correlation |
NumAliphaticRings is highly correlated with NumAliphaticCarbocycles and 5 other fields | High correlation |
NumAromaticRings is highly correlated with NumAromaticCarbocycles and 2 other fields | High correlation |
NumHAcceptors is highly correlated with NumValenceElectrons and 4 other fields | High correlation |
NumHDonors is highly correlated with NHOHCount | High correlation |
NumHeteroatoms is highly correlated with NumValenceElectrons and 3 other fields | High correlation |
NumRotatableBonds is highly correlated with NumValenceElectrons and 1 other fields | High correlation |
NumSaturatedCarbocycles is highly correlated with NumAliphaticCarbocycles and 3 other fields | High correlation |
NumSaturatedHeterocycles is highly correlated with NumAliphaticHeterocycles and 2 other fields | High correlation |
NumSaturatedRings is highly correlated with NumAliphaticCarbocycles and 5 other fields | High correlation |
RingCount is highly correlated with NumValenceElectrons and 5 other fields | High correlation |
fr_Ar_N is highly correlated with NumAromaticHeterocycles and 1 other fields | High correlation |
fr_NH0 is highly correlated with NOCount and 3 other fields | High correlation |
NumRadicalElectrons is highly correlated with df_index and 20 other fields | High correlation |
df_index has unique values | Unique |
NHOHCount has 2458 (28.8%) zeros | Zeros |
NumAliphaticCarbocycles has 7276 (85.4%) zeros | Zeros |
NumAromaticCarbocycles has 1532 (18.0%) zeros | Zeros |
NumAromaticHeterocycles has 3177 (37.3%) zeros | Zeros |
NumAliphaticHeterocycles has 4776 (56.0%) zeros | Zeros |
NumAliphaticRings has 4105 (48.2%) zeros | Zeros |
NumAromaticRings has 604 (7.1%) zeros | Zeros |
NumHDonors has 2457 (28.8%) zeros | Zeros |
NumRotatableBonds has 174 (2.0%) zeros | Zeros |
NumSaturatedCarbocycles has 7697 (90.3%) zeros | Zeros |
NumSaturatedHeterocycles has 5797 (68.0%) zeros | Zeros |
NumSaturatedRings has 5305 (62.3%) zeros | Zeros |
RingCount has 172 (2.0%) zeros | Zeros |
fr_Ar_N has 3698 (43.4%) zeros | Zeros |
fr_NH0 has 1608 (18.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-04 07:13:42.917659 |
|---|---|
| Analysis finished | 2022-11-04 07:14:52.199803 |
| Duration | 1 minute and 9.28 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 8522 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6220.515372 |
| Minimum | 0 |
|---|---|
| Maximum | 12664 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 591.05 |
| Q1 | 3023.25 |
| median | 6158 |
| Q3 | 9418.5 |
| 95-th percentile | 11950.95 |
| Maximum | 12664 |
| Range | 12664 |
| Interquartile range (IQR) | 6395.25 |
Descriptive statistics
| Standard deviation | 3657.676603 |
|---|---|
| Coefficient of variation (CV) | 0.5880021805 |
| Kurtosis | -1.216095087 |
| Mean | 6220.515372 |
| Median Absolute Deviation (MAD) | 3197 |
| Skewness | 0.0250161244 |
| Sum | 53011232 |
| Variance | 13378598.13 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5124 | 1 | < 0.1% |
| 9906 | 1 | < 0.1% |
| 6421 | 1 | < 0.1% |
| 6402 | 1 | < 0.1% |
| 9661 | 1 | < 0.1% |
| 2744 | 1 | < 0.1% |
| 5949 | 1 | < 0.1% |
| 4309 | 1 | < 0.1% |
| 7760 | 1 | < 0.1% |
| 2317 | 1 | < 0.1% |
| Other values (8512) | 8512 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 8 | 1 | |
| 10 | 1 | |
| 12 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 17 | 1 | |
| 18 | 1 |
| Value | Count | Frequency (%) |
| 12664 | 1 | |
| 12663 | 1 | |
| 12661 | 1 | |
| 12660 | 1 | |
| 12659 | 1 | |
| 12658 | 1 | |
| 12657 | 1 | |
| 12656 | 1 | |
| 12654 | 1 | |
| 12653 | 1 |
| Distinct | 118 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.0330908 |
| Minimum | 8 |
|---|---|
| Maximum | 292 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 74 |
| Q1 | 106 |
| median | 126 |
| Q3 | 146 |
| 95-th percentile | 176 |
| Maximum | 292 |
| Range | 284 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 31.63374983 |
|---|---|
| Coefficient of variation (CV) | 0.2509955887 |
| Kurtosis | 0.4907922051 |
| Mean | 126.0330908 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.1288689652 |
| Sum | 1074054 |
| Variance | 1000.694128 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 118 | 242 | 2.8% |
| 128 | 231 | 2.7% |
| 122 | 225 | 2.6% |
| 134 | 224 | 2.6% |
| 126 | 222 | 2.6% |
| 110 | 221 | 2.6% |
| 140 | 221 | 2.6% |
| 112 | 218 | 2.6% |
| 116 | 215 | 2.5% |
| 124 | 214 | 2.5% |
| Other values (108) | 6289 |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 30 | 3 | |
| 32 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 36 | 7 | |
| 38 | 6 |
| Value | Count | Frequency (%) |
| 292 | 1 | < 0.1% |
| 272 | 2 | < 0.1% |
| 262 | 1 | < 0.1% |
| 260 | 2 | < 0.1% |
| 256 | 1 | < 0.1% |
| 254 | 2 | < 0.1% |
| 250 | 3 | |
| 246 | 1 | < 0.1% |
| 244 | 1 | < 0.1% |
| 238 | 6 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8522 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8522 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8522 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8522 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8522 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8522 |
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.15524525 |
| Minimum | 1 |
|---|---|
| Maximum | 55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 20 |
| median | 24 |
| Q3 | 28 |
| 95-th percentile | 34 |
| Maximum | 55 |
| Range | 54 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.1574385 |
|---|---|
| Coefficient of variation (CV) | 0.2549110322 |
| Kurtosis | 0.4338424686 |
| Mean | 24.15524525 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.007862023607 |
| Sum | 205851 |
| Variance | 37.91404888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 570 | 6.7% |
| 26 | 551 | 6.5% |
| 22 | 548 | 6.4% |
| 23 | 534 | 6.3% |
| 25 | 524 | 6.1% |
| 21 | 512 | 6.0% |
| 27 | 489 | 5.7% |
| 20 | 459 | 5.4% |
| 28 | 441 | 5.2% |
| 29 | 428 | 5.0% |
| Other values (41) | 3466 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 10 | 0.1% |
| 7 | 16 | 0.2% |
| 8 | 20 | 0.2% |
| 9 | 35 | |
| 10 | 56 | |
| 11 | 43 |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 50 | 4 | < 0.1% |
| 49 | 3 | < 0.1% |
| 48 | 2 | < 0.1% |
| 47 | 1 | < 0.1% |
| 46 | 7 | |
| 45 | 3 | < 0.1% |
| 44 | 4 | < 0.1% |
| 43 | 10 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.256043182 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2458 |
| Zeros (%) | 28.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.225502289 |
|---|---|
| Coefficient of variation (CV) | 0.9756848377 |
| Kurtosis | 3.497259858 |
| Mean | 1.256043182 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.4899086 |
| Sum | 10704 |
| Variance | 1.501855859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3281 | |
| 0 | 2458 | |
| 2 | 1688 | |
| 3 | 652 | 7.7% |
| 4 | 250 | 2.9% |
| 5 | 112 | 1.3% |
| 6 | 52 | 0.6% |
| 7 | 16 | 0.2% |
| 8 | 11 | 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2458 | |
| 1 | 3281 | |
| 2 | 1688 | |
| 3 | 652 | 7.7% |
| 4 | 250 | 2.9% |
| 5 | 112 | 1.3% |
| 6 | 52 | 0.6% |
| 7 | 16 | 0.2% |
| 8 | 11 | 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 11 | 0.1% |
| 7 | 16 | 0.2% |
| 6 | 52 | 0.6% |
| 5 | 112 | 1.3% |
| 4 | 250 | 2.9% |
| 3 | 652 | 7.7% |
| 2 | 1688 | |
| 1 | 3281 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.461746069 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.022407895 |
|---|---|
| Coefficient of variation (CV) | 0.3702859616 |
| Kurtosis | 0.4127164824 |
| Mean | 5.461746069 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.4202397953 |
| Sum | 46545 |
| Variance | 4.090133695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1710 | |
| 6 | 1536 | |
| 4 | 1431 | |
| 7 | 1158 | |
| 3 | 874 | |
| 8 | 695 | |
| 2 | 440 | 5.2% |
| 9 | 357 | 4.2% |
| 10 | 139 | 1.6% |
| 1 | 72 | 0.8% |
| Other values (6) | 110 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.1% |
| 1 | 72 | 0.8% |
| 2 | 440 | 5.2% |
| 3 | 874 | |
| 4 | 1431 | |
| 5 | 1710 | |
| 6 | 1536 | |
| 7 | 1158 | |
| 8 | 695 | |
| 9 | 357 | 4.2% |
| Value | Count | Frequency (%) |
| 15 | 4 | < 0.1% |
| 14 | 7 | 0.1% |
| 13 | 10 | 0.1% |
| 12 | 22 | 0.3% |
| 11 | 60 | 0.7% |
| 10 | 139 | 1.6% |
| 9 | 357 | 4.2% |
| 8 | 695 | |
| 7 | 1158 | |
| 6 | 1536 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2247125088 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 7276 |
| Zeros (%) | 85.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6630980644 |
|---|---|
| Coefficient of variation (CV) | 2.950872953 |
| Kurtosis | 19.39965591 |
| Mean | 0.2247125088 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.034303936 |
| Sum | 1915 |
| Variance | 0.439699043 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7276 | |
| 1 | 877 | 10.3% |
| 2 | 204 | 2.4% |
| 4 | 95 | 1.1% |
| 3 | 52 | 0.6% |
| 5 | 15 | 0.2% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7276 | |
| 1 | 877 | 10.3% |
| 2 | 204 | 2.4% |
| 3 | 52 | 0.6% |
| 4 | 95 | 1.1% |
| 5 | 15 | 0.2% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 15 | 0.2% |
| 4 | 95 | 1.1% |
| 3 | 52 | 0.6% |
| 2 | 204 | 2.4% |
| 1 | 877 | 10.3% |
| 0 | 7276 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.330673551 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1532 |
| Zeros (%) | 18.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8779638678 |
|---|---|
| Coefficient of variation (CV) | 0.6597890724 |
| Kurtosis | -0.1395843453 |
| Mean | 1.330673551 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.2438943961 |
| Sum | 11340 |
| Variance | 0.7708205532 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3363 | |
| 2 | 2984 | |
| 0 | 1532 | |
| 3 | 568 | 6.7% |
| 4 | 71 | 0.8% |
| 5 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1532 | |
| 1 | 3363 | |
| 2 | 2984 | |
| 3 | 568 | 6.7% |
| 4 | 71 | 0.8% |
| 5 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 71 | 0.8% |
| 3 | 568 | 6.7% |
| 2 | 2984 | |
| 1 | 3363 | |
| 0 | 1532 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9290072753 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 3177 |
| Zeros (%) | 37.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.8746081928 |
|---|---|
| Coefficient of variation (CV) | 0.9414438574 |
| Kurtosis | -0.3380616858 |
| Mean | 0.9290072753 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.5958961531 |
| Sum | 7917 |
| Variance | 0.7649394909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3177 | |
| 1 | 3151 | |
| 2 | 1843 | |
| 3 | 325 | 3.8% |
| 4 | 25 | 0.3% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3177 | |
| 1 | 3151 | |
| 2 | 1843 | |
| 3 | 325 | 3.8% |
| 4 | 25 | 0.3% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 25 | 0.3% |
| 3 | 325 | 3.8% |
| 2 | 1843 | |
| 1 | 3151 | |
| 0 | 3177 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6022060549 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 4776 |
| Zeros (%) | 56.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7872859055 |
|---|---|
| Coefficient of variation (CV) | 1.307336416 |
| Kurtosis | 1.702728474 |
| Mean | 0.6022060549 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.26706303 |
| Sum | 5132 |
| Variance | 0.6198190971 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4776 | |
| 1 | 2537 | |
| 2 | 1078 | 12.6% |
| 3 | 95 | 1.1% |
| 4 | 27 | 0.3% |
| 5 | 8 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4776 | |
| 1 | 2537 | |
| 2 | 1078 | 12.6% |
| 3 | 95 | 1.1% |
| 4 | 27 | 0.3% |
| 5 | 8 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 8 | 0.1% |
| 4 | 27 | 0.3% |
| 3 | 95 | 1.1% |
| 2 | 1078 | 12.6% |
| 1 | 2537 | |
| 0 | 4776 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8269185637 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 4105 |
| Zeros (%) | 48.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.040304141 |
|---|---|
| Coefficient of variation (CV) | 1.258049083 |
| Kurtosis | 4.394591789 |
| Mean | 0.8269185637 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.701312656 |
| Sum | 7047 |
| Variance | 1.082232705 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4105 | |
| 1 | 2637 | |
| 2 | 1276 | 15.0% |
| 3 | 274 | 3.2% |
| 4 | 150 | 1.8% |
| 5 | 58 | 0.7% |
| 6 | 15 | 0.2% |
| 8 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4105 | |
| 1 | 2637 | |
| 2 | 1276 | 15.0% |
| 3 | 274 | 3.2% |
| 4 | 150 | 1.8% |
| 5 | 58 | 0.7% |
| 6 | 15 | 0.2% |
| 7 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 6 | 15 | 0.2% |
| 5 | 58 | 0.7% |
| 4 | 150 | 1.8% |
| 3 | 274 | 3.2% |
| 2 | 1276 | 15.0% |
| 1 | 2637 | |
| 0 | 4105 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.259680826 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 604 |
| Zeros (%) | 7.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.160124874 |
|---|---|
| Coefficient of variation (CV) | 0.5134020967 |
| Kurtosis | -0.3522425818 |
| Mean | 2.259680826 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.01304018421 |
| Sum | 19257 |
| Variance | 1.345889723 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2866 | |
| 3 | 2287 | |
| 1 | 1519 | |
| 4 | 1100 | 12.9% |
| 0 | 604 | 7.1% |
| 5 | 132 | 1.5% |
| 6 | 13 | 0.2% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 604 | 7.1% |
| 1 | 1519 | |
| 2 | 2866 | |
| 3 | 2287 | |
| 4 | 1100 | 12.9% |
| 5 | 132 | 1.5% |
| 6 | 13 | 0.2% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 13 | 0.2% |
| 5 | 132 | 1.5% |
| 4 | 1100 | 12.9% |
| 3 | 2287 | |
| 2 | 2866 | |
| 1 | 1519 | |
| 0 | 604 | 7.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.740553861 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 16 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.943919911 |
|---|---|
| Coefficient of variation (CV) | 0.4100617708 |
| Kurtosis | 0.2487370973 |
| Mean | 4.740553861 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.4808668791 |
| Sum | 40399 |
| Variance | 3.77882462 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 1822 | |
| 5 | 1525 | |
| 3 | 1446 | |
| 6 | 1210 | |
| 7 | 777 | |
| 2 | 743 | |
| 8 | 504 | 5.9% |
| 9 | 201 | 2.4% |
| 1 | 191 | 2.2% |
| 10 | 54 | 0.6% |
| Other values (5) | 49 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 16 | 0.2% |
| 1 | 191 | 2.2% |
| 2 | 743 | |
| 3 | 1446 | |
| 4 | 1822 | |
| 5 | 1525 | |
| 6 | 1210 | |
| 7 | 777 | |
| 8 | 504 | 5.9% |
| 9 | 201 | 2.4% |
| Value | Count | Frequency (%) |
| 16 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 12 | 12 | 0.1% |
| 11 | 17 | 0.2% |
| 10 | 54 | 0.6% |
| 9 | 201 | 2.4% |
| 8 | 504 | 5.9% |
| 7 | 777 | |
| 6 | 1210 | |
| 5 | 1525 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.137643746 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2457 |
| Zeros (%) | 28.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.018635414 |
|---|---|
| Coefficient of variation (CV) | 0.8953905104 |
| Kurtosis | 1.760010137 |
| Mean | 1.137643746 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.043050025 |
| Sum | 9695 |
| Variance | 1.037618107 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3507 | |
| 0 | 2457 | |
| 2 | 1771 | |
| 3 | 575 | 6.7% |
| 4 | 160 | 1.9% |
| 5 | 36 | 0.4% |
| 6 | 13 | 0.2% |
| 8 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2457 | |
| 1 | 3507 | |
| 2 | 1771 | |
| 3 | 575 | 6.7% |
| 4 | 160 | 1.9% |
| 5 | 36 | 0.4% |
| 6 | 13 | 0.2% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 13 | 0.2% |
| 5 | 36 | 0.4% |
| 4 | 160 | 1.9% |
| 3 | 575 | 6.7% |
| 2 | 1771 | |
| 1 | 3507 | |
| 0 | 2457 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.325627787 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.240587057 |
|---|---|
| Coefficient of variation (CV) | 0.3542078561 |
| Kurtosis | 0.4632880903 |
| Mean | 6.325627787 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3994642361 |
| Sum | 53907 |
| Variance | 5.020230359 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1543 | |
| 5 | 1382 | |
| 7 | 1351 | |
| 8 | 1065 | |
| 4 | 988 | |
| 9 | 709 | |
| 3 | 533 | 6.3% |
| 10 | 373 | 4.4% |
| 2 | 240 | 2.8% |
| 11 | 166 | 1.9% |
| Other values (9) | 172 | 2.0% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 33 | 0.4% |
| 2 | 240 | 2.8% |
| 3 | 533 | 6.3% |
| 4 | 988 | |
| 5 | 1382 | |
| 6 | 1543 | |
| 7 | 1351 | |
| 8 | 1065 | |
| 9 | 709 |
| Value | Count | Frequency (%) |
| 18 | 3 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 4 | < 0.1% |
| 15 | 7 | 0.1% |
| 14 | 15 | 0.2% |
| 13 | 22 | 0.3% |
| 12 | 84 | 1.0% |
| 11 | 166 | 1.9% |
| 10 | 373 | |
| 9 | 709 |
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.38324337 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 174 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.260635185 |
|---|---|
| Coefficient of variation (CV) | 0.5157448479 |
| Kurtosis | 2.496012581 |
| Mean | 4.38324337 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9577371823 |
| Sum | 37354 |
| Variance | 5.110471442 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 1709 | |
| 3 | 1518 | |
| 5 | 1434 | |
| 2 | 1051 | |
| 6 | 923 | |
| 7 | 602 | 7.1% |
| 1 | 409 | 4.8% |
| 8 | 302 | 3.5% |
| 9 | 177 | 2.1% |
| 0 | 174 | 2.0% |
| Other values (11) | 223 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 174 | 2.0% |
| 1 | 409 | 4.8% |
| 2 | 1051 | |
| 3 | 1518 | |
| 4 | 1709 | |
| 5 | 1434 | |
| 6 | 923 | |
| 7 | 602 | 7.1% |
| 8 | 302 | 3.5% |
| 9 | 177 | 2.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 3 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 7 | 0.1% |
| 15 | 8 | 0.1% |
| 14 | 4 | < 0.1% |
| 13 | 15 | 0.2% |
| 12 | 22 | |
| 11 | 47 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1525463506 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 7697 |
| Zeros (%) | 90.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5512880591 |
|---|---|
| Coefficient of variation (CV) | 3.613905261 |
| Kurtosis | 26.12203376 |
| Mean | 0.1525463506 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.712719157 |
| Sum | 1300 |
| Variance | 0.3039185241 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7697 | |
| 1 | 538 | 6.3% |
| 2 | 168 | 2.0% |
| 3 | 61 | 0.7% |
| 4 | 48 | 0.6% |
| 5 | 9 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7697 | |
| 1 | 538 | 6.3% |
| 2 | 168 | 2.0% |
| 3 | 61 | 0.7% |
| 4 | 48 | 0.6% |
| 5 | 9 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 9 | 0.1% |
| 4 | 48 | 0.6% |
| 3 | 61 | 0.7% |
| 2 | 168 | 2.0% |
| 1 | 538 | 6.3% |
| 0 | 7697 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.432292889 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 5797 |
| Zeros (%) | 68.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7058553534 |
|---|---|
| Coefficient of variation (CV) | 1.632817405 |
| Kurtosis | 2.510863834 |
| Mean | 0.432292889 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.619426102 |
| Sum | 3684 |
| Variance | 0.4982317799 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5797 | |
| 1 | 1858 | 21.8% |
| 2 | 796 | 9.3% |
| 3 | 54 | 0.6% |
| 4 | 14 | 0.2% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5797 | |
| 1 | 1858 | 21.8% |
| 2 | 796 | 9.3% |
| 3 | 54 | 0.6% |
| 4 | 14 | 0.2% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 14 | 0.2% |
| 3 | 54 | 0.6% |
| 2 | 796 | 9.3% |
| 1 | 1858 | 21.8% |
| 0 | 5797 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5848392396 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 5305 |
| Zeros (%) | 62.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9107039433 |
|---|---|
| Coefficient of variation (CV) | 1.557186799 |
| Kurtosis | 4.862347898 |
| Mean | 0.5848392396 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.912452117 |
| Sum | 4984 |
| Variance | 0.8293816723 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5305 | |
| 1 | 1944 | 22.8% |
| 2 | 948 | 11.1% |
| 3 | 203 | 2.4% |
| 4 | 90 | 1.1% |
| 5 | 21 | 0.2% |
| 6 | 9 | 0.1% |
| 9 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5305 | |
| 1 | 1944 | 22.8% |
| 2 | 948 | 11.1% |
| 3 | 203 | 2.4% |
| 4 | 90 | 1.1% |
| 5 | 21 | 0.2% |
| 6 | 9 | 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 9 | 0.1% |
| 5 | 21 | 0.2% |
| 4 | 90 | 1.1% |
| 3 | 203 | 2.4% |
| 2 | 948 | 11.1% |
| 1 | 1944 | 22.8% |
| 0 | 5305 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.08659939 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 172 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.239725508 |
|---|---|
| Coefficient of variation (CV) | 0.4016476877 |
| Kurtosis | 0.5662623963 |
| Mean | 3.08659939 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.07097975018 |
| Sum | 26304 |
| Variance | 1.536919335 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2793 | |
| 4 | 2232 | |
| 2 | 1775 | |
| 5 | 728 | 8.5% |
| 1 | 637 | 7.5% |
| 0 | 172 | 2.0% |
| 6 | 146 | 1.7% |
| 7 | 23 | 0.3% |
| 8 | 14 | 0.2% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 172 | 2.0% |
| 1 | 637 | 7.5% |
| 2 | 1775 | |
| 3 | 2793 | |
| 4 | 2232 | |
| 5 | 728 | 8.5% |
| 6 | 146 | 1.7% |
| 7 | 23 | 0.3% |
| 8 | 14 | 0.2% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 14 | 0.2% |
| 7 | 23 | 0.3% |
| 6 | 146 | 1.7% |
| 5 | 728 | 8.5% |
| 4 | 2232 | |
| 3 | 2793 | |
| 2 | 1775 | |
| 1 | 637 | 7.5% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.35590237 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 3698 |
| Zeros (%) | 43.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.50083293 |
|---|---|
| Coefficient of variation (CV) | 1.10688864 |
| Kurtosis | -0.3044632404 |
| Mean | 1.35590237 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.8356288165 |
| Sum | 11555 |
| Variance | 2.252499485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3698 | |
| 2 | 1668 | |
| 1 | 1327 | 15.6% |
| 4 | 1028 | 12.1% |
| 3 | 638 | 7.5% |
| 5 | 125 | 1.5% |
| 6 | 30 | 0.4% |
| 8 | 5 | 0.1% |
| 7 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3698 | |
| 1 | 1327 | 15.6% |
| 2 | 1668 | |
| 3 | 638 | 7.5% |
| 4 | 1028 | 12.1% |
| 5 | 125 | 1.5% |
| 6 | 30 | 0.4% |
| 7 | 3 | < 0.1% |
| 8 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 5 | 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 30 | 0.4% |
| 5 | 125 | 1.5% |
| 4 | 1028 | 12.1% |
| 3 | 638 | 7.5% |
| 2 | 1668 | |
| 1 | 1327 | 15.6% |
| 0 | 3698 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.05573809 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 1608 |
| Zeros (%) | 18.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.569966792 |
|---|---|
| Coefficient of variation (CV) | 0.7636998119 |
| Kurtosis | -0.1770778602 |
| Mean | 2.05573809 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.5599668485 |
| Sum | 17519 |
| Variance | 2.464795729 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2161 | |
| 1 | 1786 | |
| 0 | 1608 | |
| 3 | 1327 | |
| 4 | 998 | |
| 5 | 458 | 5.4% |
| 6 | 150 | 1.8% |
| 7 | 28 | 0.3% |
| 8 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1608 | |
| 1 | 1786 | |
| 2 | 2161 | |
| 3 | 1327 | |
| 4 | 998 | |
| 5 | 458 | 5.4% |
| 6 | 150 | 1.8% |
| 7 | 28 | 0.3% |
| 8 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 28 | 0.3% |
| 6 | 150 | 1.8% |
| 5 | 458 | 5.4% |
| 4 | 998 | |
| 3 | 1327 | |
| 2 | 2161 | |
| 1 | 1786 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | NumValenceElectrons | NumRadicalElectrons | HeavyAtomCount | NHOHCount | NOCount | NumAliphaticCarbocycles | NumAromaticCarbocycles | NumAromaticHeterocycles | NumAliphaticHeterocycles | NumAliphaticRings | NumAromaticRings | NumHAcceptors | NumHDonors | NumHeteroatoms | NumRotatableBonds | NumSaturatedCarbocycles | NumSaturatedHeterocycles | NumSaturatedRings | RingCount | fr_Ar_N | fr_NH0 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5124 | 50 | 0 | 9 | 1 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | 4 | 3 | 0 | 0 | 0 | 0 | 0 | 1 |
| 1 | 8178 | 124 | 0 | 24 | 1 | 8 | 0 | 1 | 1 | 1 | 1 | 2 | 7 | 1 | 9 | 6 | 0 | 0 | 0 | 3 | 2 | 3 |
| 2 | 7753 | 116 | 0 | 22 | 0 | 6 | 0 | 1 | 1 | 1 | 1 | 2 | 6 | 0 | 7 | 4 | 0 | 0 | 0 | 3 | 4 | 5 |
| 3 | 3392 | 162 | 0 | 30 | 2 | 6 | 0 | 1 | 1 | 2 | 2 | 2 | 5 | 2 | 11 | 4 | 0 | 1 | 1 | 4 | 2 | 2 |
| 4 | 1761 | 122 | 0 | 22 | 0 | 5 | 0 | 1 | 0 | 2 | 2 | 1 | 4 | 0 | 6 | 4 | 0 | 2 | 2 | 3 | 0 | 2 |
| 5 | 6014 | 126 | 0 | 24 | 0 | 2 | 0 | 2 | 0 | 2 | 2 | 2 | 4 | 0 | 4 | 2 | 0 | 1 | 1 | 4 | 0 | 2 |
| 6 | 6047 | 162 | 0 | 32 | 0 | 5 | 0 | 3 | 1 | 2 | 2 | 4 | 5 | 0 | 5 | 4 | 0 | 1 | 1 | 6 | 1 | 2 |
| 7 | 6288 | 48 | 0 | 9 | 2 | 5 | 0 | 0 | 1 | 0 | 0 | 1 | 3 | 2 | 5 | 0 | 0 | 0 | 0 | 1 | 3 | 1 |
| 8 | 1012 | 138 | 0 | 28 | 0 | 6 | 1 | 2 | 2 | 0 | 1 | 4 | 6 | 0 | 7 | 4 | 1 | 0 | 1 | 5 | 4 | 4 |
| 9 | 2286 | 106 | 0 | 21 | 0 | 6 | 0 | 1 | 2 | 0 | 0 | 3 | 6 | 0 | 6 | 4 | 0 | 0 | 0 | 3 | 4 | 4 |
Last rows
| df_index | NumValenceElectrons | NumRadicalElectrons | HeavyAtomCount | NHOHCount | NOCount | NumAliphaticCarbocycles | NumAromaticCarbocycles | NumAromaticHeterocycles | NumAliphaticHeterocycles | NumAliphaticRings | NumAromaticRings | NumHAcceptors | NumHDonors | NumHeteroatoms | NumRotatableBonds | NumSaturatedCarbocycles | NumSaturatedHeterocycles | NumSaturatedRings | RingCount | fr_Ar_N | fr_NH0 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8512 | 4615 | 80 | 0 | 15 | 3 | 5 | 0 | 1 | 0 | 0 | 0 | 1 | 3 | 3 | 5 | 5 | 0 | 0 | 0 | 1 | 0 | 0 |
| 8513 | 9434 | 130 | 0 | 26 | 1 | 6 | 0 | 3 | 0 | 0 | 0 | 3 | 4 | 1 | 6 | 5 | 0 | 0 | 0 | 3 | 0 | 1 |
| 8514 | 8731 | 188 | 0 | 35 | 0 | 9 | 0 | 2 | 1 | 1 | 1 | 3 | 8 | 0 | 11 | 9 | 0 | 1 | 1 | 4 | 2 | 4 |
| 8515 | 2006 | 130 | 0 | 26 | 0 | 7 | 0 | 1 | 3 | 0 | 0 | 4 | 8 | 0 | 8 | 4 | 0 | 0 | 0 | 4 | 4 | 4 |
| 8516 | 11103 | 126 | 0 | 25 | 1 | 9 | 0 | 2 | 0 | 1 | 1 | 2 | 6 | 1 | 9 | 4 | 0 | 0 | 0 | 3 | 0 | 2 |
| 8517 | 8769 | 96 | 0 | 19 | 2 | 4 | 0 | 1 | 2 | 0 | 0 | 3 | 3 | 2 | 5 | 3 | 0 | 0 | 0 | 3 | 2 | 1 |
| 8518 | 5247 | 182 | 0 | 38 | 4 | 6 | 0 | 4 | 2 | 0 | 0 | 6 | 6 | 2 | 6 | 5 | 0 | 0 | 0 | 6 | 4 | 4 |
| 8519 | 3835 | 114 | 0 | 20 | 1 | 3 | 0 | 1 | 0 | 0 | 0 | 1 | 2 | 1 | 3 | 4 | 0 | 0 | 0 | 1 | 0 | 1 |
| 8520 | 10980 | 122 | 0 | 22 | 0 | 5 | 0 | 0 | 1 | 0 | 0 | 1 | 4 | 0 | 9 | 4 | 0 | 0 | 0 | 1 | 1 | 2 |
| 8521 | 3473 | 102 | 0 | 20 | 1 | 5 | 0 | 1 | 1 | 0 | 0 | 2 | 4 | 1 | 5 | 3 | 0 | 0 | 0 | 2 | 1 | 1 |